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Abstract .— A large class of phylogenetic networks can be obtained from trees by the addition 
of horizontal edges between the tree edges. These networks are called tree based networks. 
Reticulation-visible networks and child-sibling networks are all tree based. In this work, we 
present a simply necessary and sufficient condition for tree-based networks and prove that 
there is a universal tree based network N for each set of species such that every phylogenetic 
tree on the same species is a base of N. The existence of universal tree based network implies 
that for any given set of phylogenetic trees (resp. clusters) on the same species there exists 
a tree base network that display all of them. 
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Reticulation process refers to the transfer of genes between organisms in a way other 
than reproduction. One of the major reticulation processes is horizontal gene transfer. It has 
been considered as a highly significant form of genetic transfer among single-cell organisms 
(Doolittle, 1999; Doolittle and Bapteste, 2007; Smets and Barkay, 2006; Treangen and Rocha, 
2011). Other reticulation processes include introgression, recombination and hybridization 
(Dagan and Martin, 2006; Fontaine et ah, 2015; McBreen and Lockhart, 2006). 

A set of gene trees are usually reconciled into a phylogenetic network to model retic¬ 
ulation processes (Doolittle and Bapteste, 2007; Huson, Rupp, and Scornavacca, 2011). A 
phylogenetic network is a rooted acyclic digraph in which there is a special node of out-degree 
2 and in-degree 0 (called the root) such that all the edges are directed away from it and the 
set of in-degree 0 nodes correspond one-to-one the collection of present-day taxa under study. 
A network is binary if every node other than the root and leaves is of degree three. Clearly, 
a phylogenetic tree is a binary phylogenetic network without reticulation nodes. 

Horizontal gene transfers are naturally modeled and visualized by using a tree-based 
phylogenetic network, where the underlying base tree represents the evolution of the species 
from which genes are sampled and branches are added between tree branches to represent 
horizontal gene transfers (Smets and Barkay, 2006; Nakhleh, 2013). Surprisingly, phyloge¬ 
netic networks that are used for modeling other reticulation processes may also have the same 
topological structure, obtained from a tree by the addition of branches between tree branches, 
even if not every binary phylogenetic network shares this property (van lersel, 2013). Re¬ 
cently, Francis and Steel (2015) initiated the study of tree-based networks. In their paper, 
sufficient conditions for tree-based networks are presented. They further showed that this 
class of networks include reticulation visible networks and tree-sibling networks. 

In the present work, we answer two problems posed by Francis and Steel (2015). 
Precisely, we present a simple necessary and sufficient condition for tree-based networks. We 
also construct a universal network on X that has every tree on X as its base for any X of 
an arbitrary size. 



Tree-based Networks 


Basic Definitions 

A digraph D consists of a set of vertexes, V{D), and a collection of directed edges, E{D), 
that each connects an ordered pair of vertexes. We call {u,v) G E{D) an ontgoing edge of 
u and an incoming edge of v. For each x G V{D), the nnmber of the incoming edges of x 
is called its indegree] the nnmber of the ontgoing edges of x is call its outdegree; the snm of 
the indegree and ontdegree of x is called its degree. The indegree, ontdegree, and degree of 
X are writeen d^{x), d°{x) and d{x), respectively. 

A path from a: to ?/ in D is made np of two or more “snccessive” vertexes x = 
ui,U 2 , ■ ■ ■ ,Uk = y, where G E{D) for 1 < i < fc — 1 and k > 2. A cycle is a path 

from a node to itself. D is acyclic if it does not contain any cycle. 

A binary phylogenetic network over a set X of species is an acyclic digraph with the 
following properties: 

• There exists a nniqne vertex p snch that d''{p) = 0. It is the root of the network. The 
root is of ontdegree 2. 

• There are exactly |A| nodes £ snch that d^{£) = 1 and d°{£) = 0, corresponding one- 
to-one with the species. These nodes are called the leaves of the network. 

• All the vertexes that are neither a leaf nor the root are of degree three. They are called 
internal nodes. 

An internal node x in a binary phylogenetic network is called a tree (or speciation) 
node if d\x) = 1 and d°{x) = 2; it is called a reticulation node if d\x) = 2 and d°{x) = 1. 
Since the root is the only vertex having indegree 0 in a phylogenetic network, there is a path 
from the root to every other vertex. For two vertexes x and y, if there is a path from y to 
X, y is said to be an ancestor of x and x is said to be a descendant of y. 

A binary phylogenetic networks is shown in Fignre[^ where we draw an open branch 
entering the root, representing the least common ancestor of all the species. In rest of the 
paper, for a binary phylogenetic network A, we shall nse the following notation: 

• pn'- The root of A; 

• V(A): The set of nodes in A; 

• T{N): The set of tree nodes in A; 

• 71{N): The set of reticnlation nodes in A; 

• T(A): The set of (directed) edges in A; 

• T(A): The set of labelled leaves in A; 

• c{u): The nniqne child of n if n G 71{N), or the set of the children of n if n G T(A); 




Figure 1: A tree-based phylogenetic network (left) and a tree base of it (right). The sub¬ 
division of the base tree (middle) is a subtree of the network that can be obtained by the 
removal of the edges ei and 62 - Reticulation nodes in the network are represented by shaded 
circles, 


• p{u): The unique parent of u if u G T{N), or the set of the parents of n if n G 71{N)-, 


Tree-based networks 

Let be a network over a set of species, X. For a subset E C S{N), N — E denotes 
the subnetwork of N obtained after the removal of the edges in E. If E contains exactly an 
incoming edge for each reticulation vertex, then every non-root node in — i? is of indegree 
1 and hence is a tree. However, it may contain new leaves. N is tree-based if there exists 
E C S{N) such that N — E is a subtree having the same leaves as N. 

The network in Figure is tree-based. It has two reticulation vertexes. The edge Ci 
enters the top reticulation vertex, whereas 62 is an edge entering the other at the bottom. 
The removal of these two edges results in a subtree with the same leaves as the network. On 
the other hand, the network in Figure is not tree-based. The reason is that no matter 
which of the incoming edges (ri,r 4 ) and (r 2 ,r 4 ) is removed for r 4 , the tail of the removed 
edge becomes a new leaf in the resulting subtree. 

Tree-based networks compose of a large class of interesting networks. A vertex in a 
phylogenetic network is called visible (or stable) if there exists a leaf such that every path 
from the network root to the leaf passes through the vertex. A network is reticulation visible 
if every reticulation vertex is visible. Reticulation visible networks are tree-based (Francis 
and Steel, 2015; Gambette et al, 2015). 

A phylogenetic network is tree sibling if every reticulation vertex has a tree vertex 
sibling. Tree sibling networks are also tree based (Francis and Steel, 2015). 






Main Results 


A necessary and sufficient condition for tree-based networks 

In a binary phylogenetic network, a reticulation vertex is said to be of: 

• type-0 if if its parents are both a reticulation vertex; 

• type-1 if a parent is a reticulation vertex and the other is a tree vertex; 

• type-2 if its parents are both a tree vertex. 

In the network drawn in Fig|^., the vertexes ri,r 2 and are of type-2, is of type-1, 
and r 4 is of type-0. A tree-based network must not contain any type-0 reticulation vertexes 
(Francis and Steel, 2015). 

Let A be a binary phylogenetic network without type-2 vertexes. Setting TZ{N) = 
{ri,r 2 , •■■j'Ts}, we dehne an undirected bipartite graph B{N) = (A U Y,E) as follows: 

X = {xi, X 2 ,... ,Xs \ Xi represents for each i}, 

U = { 2 / 1 , y 2 , ■ ■ ■ ,yt \ Vi represents a parent in T{N) of a vertex in 71{N) }, 


and 

E = {{yj, Xi) I the vertex represented by yj is a parent of the vertex by Xi in N.}- 

Remark that B{N) is essentially a bipartite subgraph of N. For example. Figure]^ shows 
the bipartite network dehned for the network in Figure [^, in which X 4 is not connected 
with any other vertex, as the parents of r 4 are both not a tree vertex. 

Using the technique of Gambette et al. (2015), we are able to present a simple 
necessary and sufficient condition for binary tree-based phylogenetic networks. 

Lemma 1 Let N be a network without type-0 reticulation vertexes. Then N is tree-based if 
and only if for every two type -1 reticulation vertexes, their correspondences are not connected 
in Bn. 

Proof. First, we have the following two facts: 

(i) Let e = {x,y) G L{N). If a; is a reticulation vertex, then x has out-degree 0 and 
hence becomes a leaf in A — {e}. 

(ii) Let ei = (xi,j/i) G S{N) and 62 = {x 2 ,y 2 ) G S{N) such that Xi G T(A) and 
yi G 1Z{N) for z = 1, 2. If xi = X 2 , then xi becomes a leaf in A — {ei, 62 }. If z/i = 7 / 2 , then 
yi has in-degree 0 in A — {ei, 62 }. 

Each edge (t,r) in A corresponds an edge in B{N), where t G T(A) and r G 71{N). 
For a subset E C S{N) n (T(A) x 71{N)), we set Eb{n) to be the subset of edges in B{N) 
that correspond one-to-one to the edges in E. The two facts stated above imply that N — E 
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Figure 2: (A) A binary phylogenetic network N, in which reticulation vertexes are repre¬ 
sented by shaded circles. (B) The bipartite graph B{N) dehned for N. 


is a tree network with the same leaves as N if and only if is a matching covering every 
reticulation vertex in N and hence if and only if Eb{n) is a complete matching from X to T 
in B{N). 

Since B{N) is bipartite, by Hall’s theorem, there is a complete matching from X to 
Y if and only if \X'\ < |X(X')| for any X' C X, where X(X') is the set of vertexes that are 
adjacent with some vertexes in X', and so if and only if there is a complete matching from 
C n X to C n y for every connected component C in B{N). 

A vertex Xi in B{N) is of degree 1 if it corresponds a type-1 reticulation vertex; it 
is of degree 2 if it corresponds a type-2 reticulation vertex. Each vertex y in B{N) has also 
degree 1 or degree 2, as the tree vertex represented y has one or two reticulation children. 
Therefore, every connected component is either a cycle or a path in B{N). Let C be a 
connected component in B{N). If C is a cycle, C has a perfect matching from C fl X to 
C nY. If C* is a path, it contains exactly two degree-1 vertexes w' and w". There is a 
complete matching from C fl X to C fl X if and only if either w' or w" is not in X. 

Since the degree-1 vertexes in X correspond one-to-one to the type-1 reticulation 
vertexes, we conclude that N is tree-based if and only if the correspondences of every two 
type-1 reticulation vertexes are not connected in B{N). □ 

Let G 71{N). We say that they are connected by a zigzagy path if there is 
a sequence of vertexes u = xo,xi,... ,X 2 k = v such that the vertexes alternate between 
reticulation vertexes and their tree vertex parents (Figure]^. 

Recalled that B{N) is a disjoint union of paths and cycles. Obviously, each cycle 
contains only type-2 reticulation vertexes. Each type-1 reticulation vertex appears only at 
the ends of a path. Therefore, by Lemma we have the following theorem. 

Theorem 1 Let N be a binary network. N is tree-based if and only if (i) there is no type-0 
reticulation vertex in N, and (i) no two type-1 vertexes are connected by a zigzagy path. 








Figure 3: (A) A maximal zigzag path between two type-1 reticulation vertexes ri and r 2 - 
(B) A maximal zigzagy path between a type-1 reticulation vertex ri and a type-2 reticulation 
vertex r 2 . (C) A maximal zigzagy path between two type-2 reticulaiton vertexes ri and r 2 . 
(D) A zigzagy cycle consisting of only type-2 reticulation vertexes. 


Theorem [T] implies the following algorithm for determining whether a network is tree- 
based or not. 

Input A binary network A; 

1. If it contains a type-0 reticulation vertex, output “A is not tree-based”; 

2. Do until there is no unmarked type-1 reticulation vertex { 

Select an unmarked type-1 reticulation vertex u; 

If the zigzagy path starting at u terminates at an 

unmarked type-1 reticulation vertex, output “A is not tree-based”; 
else mark u; 

3. Output “A is tree-based”; 


Obviously, the above algorithm is correct. Since any two zigzagy paths are disjoint, 
it takes a linear time. 

Universal tree-based networks 

It is known that there exists a network that displays every phylogenetic tree on the 
same species (see, for example, Francis and Steel, 2015). However, a tree may be displayed, 
but not as a base, in a phylogenetic network. Therefore, the following question is posed by 
Francis and Steel: 

Does there exist a network U over X such that every phylogenetic tree over X 
is a base for U for every large set of species A? 







For |X| = 3, such a universal network exists (Francis and Steel, 2015). We shall present 
such a universal tree-based network U for every X in the rest of this section. 

Let X = {1, 2, • • ■ , m}, m > 3. The network on X is divided into the upper and 
lower parts (Figure |^) and (Figure |^)- The upper part is denoted by Fupper- It is a 
(2m — 3)-row network in which: 

• the root pu is the unique vertex in the row 1, written foi; 

• the row 2 i comprises i + 1 tree vertexes tji, • • • , for z = 1, 2,..., m — 2; 

• the row 2z -|- 1 comprises i reticulation vertexes rn, ri 2 , ■ ■ ■ ^ra for z = 1, 2,..., m — 2; 

• the edge set comprises (Figure]^): 

(middle diagonal edges) {tij,rij), (tj^+i), ri^), l<j<z, l<z<m — 2, 

(side edges) {tu, tp+pi), (fip+p, f(i+i)(i+ 2 )), 0 < z < m - 3, 

(middle vertical edges) (rjj, f(j+i)(j_|_i)), l<j<z, l<z<m — 3. 

Figure shows how the rooted binary tree (^i, (((^ 2 , f's), ^ 4 ), ^ 5 )) is displayed in 
f/upper, in which the Tth leaf counted from left corresponds to ii, z < 5. 

Lemma 2 Let m > 3 and let U 2 m-A consist of the vertexes in the top 2m — 4 rows and the 
edges between them in Uupper- Then, every phylogenetic tree T over {1, 2, • • • , m — 1} is a 
base of U 2 m-A, where the j-th leaf {counted from left ) in T is mapped to t(m- 2 )j for each j 
from 1 to m — 1. 

The lower part Flower essentially a rearrangeable network with m inputs and m 
outputs. A network with m inputs and m outputs is said to be rearrangeable if for any 
one-to-one mapping tt of the inputs to the outputs, we can construct vertex-disjoint paths 
in the network linking the Ah input and the 7r(z)th output for 1 < z < m (Leighton, 1992). 
Figure shows the vertex-disjoint paths for the mapping vr = (45312) in where 

m = 5 and vr maps z to the z-th digit inside the parentheses. 

t/jower ^ mimic of the rearrangeable network derived from the well-known even-odd 
transposition sorting process in a linear array (Leighton, 1992, page 139). The topological 
structure of L^iower slightly different for odd m and even m. We use Ri to denote the row 
z in U. 

When m is odd, is divided into 2m -|- 1 rows i? 2 m- 2 , R 2 m-i, • • ■ , Rim- 2 - 

• For each z = m — 1, m -|- 1, • • ■ , 2m — 2, i? 2 i comprises m — 1 tree vertexes tij (1 < j < 
m — 1); i? 2 i+i comprises m — 1 reticulation vertexes rij (1 < j < m — 1). 

• For each z = m, m -f 2, • • • , 2m — 3, i? 2 i comprise m — 1 tree vertexes t^ {2 < j < m); 
/? 2 i+i comprises m — 1 reticulation vertexes rij {2 < j < m). 

• The last row Rim -2 comprises m leaves labelled with j (1 < j < m) from left to right. 
For sake of convenience, the j-th leaf is denoted by t( 2 m-i)y 
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Figure 4: The universal network U with hve leaves. (A) The upper part t/upper- (B) The 
lower part f^Jower- square dot arrows represent edges between the two parts. (C) The 
display of a rooted binary tree with hve unlabeled leaves in Fupper, where the round dot 
arrows represent the removed edges. In this display, the Tth leaf (counted from left) in the 
tree is mapped to the i-th vertex in the last row. (D) The vertex-disjoint paths for the 
one-to-one mapping tt = (45312), where tt maps i to the i-th digit inside the parentheses. 













The edges in are formally presented in Appendix A. Briefly, for i = m — 1 + 2j and 

0 < j < (m — l)/2, the m — 1 vertexes in and R 2 i+i are paired and connected in a 
butterfly, as shown in Figure |^. 

For i = m + 2j and 0 < j < (m — 3)/2 the m — 1 vertexes in and R 2 i+i are also 
paired and connected in a butterfly, as shown in Figure |^. 

For 0 < j < (m —3)/2, the hrst reticulation vertex r(m-i+ 2 j)i in R 2 m+Aj-i is connected 
with the hrst tree vertex t(m+i+ 2 j)i in R 2 m+ij+i^ whereas the m-th reticulation vertex r(m+ 2 j)m 
in i? 2 m+ 4 j+i is connected with the m-th tree vertex t(m+ 2 + 2 j)m in R 2 m+ 4 :j+ 4 - 

Finally, there are also m edges between the vertexes at the bottom of Fupper and 
the corresponding vertexes on the top in which are represented by the square dot 

arrows drawn between Figure and |^. 

When m is even, the structure of is presented in Appendix A. 

Lemma 3 Let vr be any one-to-one mapping on {1, 2, • • • , m}. 

(i) When m is odd, there are m vertex-disjoint paths connecting o,nd t( 2 m-i) 7 r(j) 

(1 ^ ^ m 1) and tram and t^ 2 m—i)7r(m) ^lower’ 

(ii) Whenm is even, there arem vertex-disjoint paths connecting t(^rn-i)j o^dt( 2 m-i) 7 r(j) 
(1 < j < m) in 

Additionally, every vertex in appears in one of the m paths mentioned in (i) and (ii). 

Theorem 2 Every phylogenetic tree over X is a base for U. 

Proof. Essentially, we shall prove that for each tree T, its topological structure can be 
displayed in t/upper and the leaves are then rearranged in according to the order 

they appear in T. We just prove the theorem for odd m. (The case m is even is similar.) 

Consider a phylogenetic tree T over X = {1,2, ■■ ■ ,m}. Assume that its leaves are 
listed as ii,i 2 , - ■ ■ ,^m from left to right in T, where 1 < £j < m for each j. Then, there 
exists jo such that and ijQ+i are siblings. Let po be their parent. Then, T — 

has m — 1 leaves including po. 

By Lemma T — {£jo,£jo+i} is displayed as a base in the hrst 2m — 4 rows such 
that (i) ij (1 < J < jo) is mapped to t(m- 2 p', (h) Po is mapped to t(m-2po) (iii) 

{jo -|- 1 < j < m) is mapped to f(m- 2 ) 0 -i)- Note that all the leaves in T — are 

one-to-one assigned to the tree vertexes in R 2 m-A- 

The display of T — can be extended into a display of T only by (i) reassign 

tj to for j < jo, (ii) assign and to and t(m_i)(j„+i), (iii) reassign ij 

to for jo-\-l<j<m — 1, and (iv) assign to tmm- It can be verify that such a 

display of T does not have any dummy vertex. 

Dehne vr = (£ 1^2 • • • £m)- Clearly, vr is a one-to-one mapping over X, which maps i to 
ii. By Lemma 2, there are vertex-disjoint paths that cover every vertex and connect 
and t( 2 m-i)q {I < j < m - 1) and tmm to t( 2 m-i)£™ in htiower- Combining the display of T 
and the m disjoint paths, we conclude that T is a tree base for U. □ 


Concluding Remarks 
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Figure 5: (A) The edges between vertexes in the rows 2i, 2i + 1 and 2(i + 1) in the upper 
part h^top each i. (B) The edges between vertexes in the rows 2i and 2i + 1 in the lower 
part t/top for i = m — 1, m + 1, • • ■ , 2m — 2. (C) The edges between the vertexes in the rows 
2i and 2i + 1 in the lower part for i = m,m + 2, ■ ■ ■ , 2m — 3. Here m is odd. 


The universal tree-based network we have constructed has an important implication. 
A class of phylogenetic networks is said to be complete if every collection of phylogenetic 
trees on a set of species can be displayed in a phylogenetic network on the same species in 
the class. We use Ux to denote the universal tree-based network on X for a set X of species. 
Since every phylogenetic tree on X is displayed in Ux, the class of tree-based networks is 
complete. 

In contrast, the class of reticulation visible networks is incomplete as well as its 
subclasses such as galled trees (Wang et ai, 2001) and galled networks. In fact, since a 
reticulation visible network over X has at most 4(n — 1) reticulation vertexes (Gambette et 
ah, 2015), more than different trees on X cannot all be displayed in a reticulation 

visible phylogenetic tree simultaneously. 

The completeness suggests that tree based networks are widespread in the entire space 
of phylogenetic networks. The simple linear time algorithm for testing whether a phylogenetic 
network is tree-based or not, given here, is dehnitely useful for further examination of the 
distribution of tree based networks. 

Finally, tree-based networks are a natural model for horizontal gene transfer. They 
also compose a large complete class. Therefore, it is important to study how to reconstruct 
a tree-based network with as few reticulation vertexes as possible from a set of gene trees or 
from sequence data in future. 
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Appendix A 


Proof of Lemma 2. 

Let m > 3. Recall that Ri denotes the set of vertexes in the row i in U. Let U 2 k -2 be the 
subnetwork consisting of vertexes in the top 2{k — 1) rows in Pupper, k = 2,3, ■ ■ ■ ,m — 2. 
The leaves in U 2 k -2 are all the tree vertexes t(k-i)i, t(k-i) 2 , ■ ■ ■ ,t{k-i)k in R 2 k- 2 - Note that 
U 2 m-A = Pupper — {r(m- 2 )j I 1 < j < ni — 2}. 

For each k, a vertex in a subtree of U 2 k -2 is said to be a dummy leaf if it is not in 
the lowest level R 2 k- 2 , but has out-degree 0. We prove that every phylogenetic tree with k 
leaves is a base of U 2 k -2 by induction on k. 

When k = 2, N 2 is the unique binary tree with 2 leaves. Therefore, the statement is 

true. 

Assume the statement is true for k — 1. Let T be a phylogenetic tree with k leaves. 
The depth of a vertex in a rooted tree is dehned to be the number of edges in the 
path from the root to the vertex in the tree. The depth of the root is set to 0. Let P be the 
leaf with the largest depth in T. The sibling P' of P must be also a leaf of T. (If P' were not 
a leaf, its children would have a greater depth than P.) We use p(P) to denote the parent of 
P and P' in T. Clearly, T' = T — {P, P'} is a tree with j — 1 leaves, one of which is p(P). 

By the induction hypothesis, T' is a base of U 2 {k-i )-2 = P 2 A:- 4 - Let E' C £{U 2 k-A) 
such that U 2 k-A — E' is a subdivision of T' in which there is no dummy leaf and all the 
leaves of T' correspond one-to-one the tree vertexes t{k- 2 )j if L j L k — 2) m. the row R 2 k- 4 - 
Assume that p(P) corresponds to t(k- 2 )jo for some I < jo ^ k — 2. 

Dehne E” = {{t(k- 2 )j,r(k- 2 )j) \ j < jo} U {{t(^k- 2 )j,r^k- 2 ){j-i)) \ j > jo}- Then, 
Nk — E' — E" is a subdivision of T in which (i) there is no dummy leaf, (ii) the jth leaf 
corresponds to the jth vertex t(k-i)j in R 2 k -2 for j < jo, (iii) P and P' correspond to t(^k-i)jo 
and t(k-i){jo+i)y respectively, and (iv) the jth leaf corresponds to the jth vertex t(k-i){j+i) in 
R 2 k -2 for j > Jo + 1- 

This concludes the proof of Lemma 2. □ 

The strueture 

When m is odd, Pjower consists of the last 2m-|-l rows in U: R 2 m- 2 , R 2 m-i, • • ■ , RAm- 2 - 
For i = m —1, , 2m —2, i? 2 i comprises m —1 tree vertexes tij (1 < j < m —1). 

/?2i+i comprises m — 1 reticulation vertexes rtj (1 < J < m — 1). 

For i = m,m + 2, ■ ■ ■ , 2m — 3, i? 2 i comprises m — 1 tree vertexes Uj (2 < j < m); 
R 2 i+i comprises m — 1 reticulation vertexes m — 1 Tij (2 < j < m). 

The last row Rim -2 comprises m leaves each labelled with p p < j < m) from left 
to right. For sake of convenience, the j-th leaf is denoted by t( 2 m-i)j- 



The edges in i^iclude: 

( Vertical edges) {tij, r^), — 1, i = m — 1 + 2k, 0 < fc < (m — l)/2; 

{tij,rij), 2 < j < m, i = m + 2k, 0 < fc < (m — 3)/2; 

{rij, 2, < j <m-l, m-l<i<2m-3; 

(ra, t(i+ 2 )i), i = {m - 1) + 2k,t) < k < {m - 3)/2; 

{rim, t{i+2)m), i = m + 2k, 0 < fc < (m - 3)/2; 

(^"(2m-2)l, ^(2 m— Di); 

{Diagonal edges) (trjj), 

j = 1, 3, ■ ■ ■ ,m — 2, i = m — 1 + 2k, 0 < /c < (m — l)/2; 
{tij, ^i(j+l))) (^i(i+l)) rij), 

j = 2,4, ■ ■ ■ , m — 1, i = m + 2k, 0 < A; < (m — 3)/2; 


Finally, there are m edges connecting m vertexes at the bottom in t/upper and the corre¬ 
sponding vertexes in Fjower- 

{t(m—2)1, t)m—l)l), {t[m—2){m—l), tmm), 

{r(m— 2 )j,t(m—l){j+l)), J = 1) 2, • • • ,m 2. 

When m is even, also has 2m + 1 levels, each has m or m — 2 vertexes, as 

shown in Figure]^ 

For i = m — 1, m -|- 1, • • ■ , 2m — 3, R 2 i comprises m tree vertexes tij (1 < j < m), and 
R 2 i+i comprises m reticulation vertexes (1 < j < m). 

For i = m,m + 2, - ■ ■ , 2m — 2, comprises m — 2 tree vertexes tij {2 < j < m — 1), 
and R 2 i+i comprises m — 2 reticulation vertexes (2 < j < m — 1). 

The lasr row Rim -2 consists of m leaves with labels ^j from left to right, denoted by 

t{2m-l)j (1 < j < ni). 

f/iower contains the following edges: 

{Vertical edges) (ri^, fp+ip), 

2<j<m — 1, m — l<i< 2{m — 1); 

(^il) Ap_|_2)l), {r im , t)ii-2)m) , 
i = m — 1 + 2k, 0 < k < {m — 2)/2; 

{tij, rj(j+i)), rij), 

j = 1,3, ■ ■ ■ ,m — 1, 
i = m — 1 + 2k, 0 < /c < (m — 2)/2; 

{tij, rj(j+i)), (fjQ'+i), rij), 

J = 2,4, ,m-2, 

i = m + 2k, D < k < {m — 2)/2. 


{Diagonal edges) 


^2 

i?4 

^6 

Re 

Rio 

Ri2 

i?14 



Figure 6: A universal network U with four leaves. Here, the square dot arrows represent the 
edges between the upper and lower parts. 


Finally, there are m edges connecting the m vertexes in Fupper and the corresponding 
vertexes in 


{j'{m-2)jit{m-l){j+l))i J = 1) 2, • ■ ■ , ?Tl 2. 


Proof of Lemma 3. 


We hrst consider the case m is even. We dehne m + 1 vectors Si on {1, 2, • ■ ■ , m} 
if <i <m). We use Sfj] to denote its j-th component for 1 < j < m. 

Initially, S'o = (7r(l), 7r(2), • ■ ■ ,7r(m)). After Si is computed, we compute S'j+i as 
follows. 

When i is even, S'j+i is dehned by: 


Si+i[j] 


Si[j] if ^i[j] < + 1]; 

*S'i[i + l] if S'i[j] > + 1] 


and 


+ 1] 


Si[j + 1] if S'i[j] < + 1]; 

Si[j] if Si[j] > Si[j + 1] 



for j = 1, 3, • ■ ■ , m — 1. 

When i is odd, St+i is defined by: 


and 


Si+i[m] = Si[m], 


Si+i[j] 


Si[j] if + 1]; 

Si[j + 1] if Si[j] > Si[j + 1] 


Si[j + 1] if Si[j] < Si[j + 1]; 

Si[j] if Si[j] > Si[j + 1] 


for j = 2,4, • ■ ■ , m — 2. 

Since we emulate the odd-even transposition sorting on an array with m elements 
(Leighton, 1992, page 129), Sm = 7’'(2), • • • ,7r(m)). Using S'ds, we obtain m vertex- 

disjoint paths connecting and t(2m-i)-Kii) as follows: 

For k = m — l,m + 1, ■ ■ ■ , 2m — 3, and j = 1, 3, • • ■ , m — 1, delete the vertical edges 
and ) if [j -|- 1] and [j U 1] 

and delete the diagonal edges (tkj,rk(j+i)) and (tfcQ+i),if Sk-m+2[j] = Sk-m+i[j] and 

Sk—m+2{j'\ ~ Sk—m+l[j'\- 

For k = m,m + 2, ■ ■ ■ , 2m — 2, and j = 2,4, • ■ ■ ,m — 2, delete the vertical edges 
(tkjj^kj') and (tk{j-\-i) j ^k{j-\-i)') if Sk—m+i\J 4“ 1] and Sk—m+2\J 4~ f] 

and delete the diagonal edges (tkj,rk(j+i)) and (t^o+i),if Sk-m+2[j] = Sk-m+i[j] and 
Sk—m+2 \j ] Sk—m+1 \j ] • 

Since Sm = 7’'(2), • • • ,n{m)), the resulting m vertex-disjoint paths connect 

to t( 2 m-i) 7 r(i) and pass through every vertex in 

For the case m is odd, we can prove the statement similarly. □ 



